Stream-DBSCAN: A Streaming Distributed Clustering Model for Water Quality Monitoring

نویسندگان

چکیده

With the increasing use of wireless sensor networks in water quality monitoring, an enormous amount streaming data is generated by widely deployed sensors. However, current batch mode used for analysis can no longer meet diverse combination monitoring indicators and requirement timely results on all-weather basis. To overcome these challenges analyze a large quickly accurately, we propose stream-DBSCAN distributed stream processing clustering model. First, real-time streams are processed using computing framework Flink. Then, DBSCAN algorithm applied to cluster each dataset as different dimension cluster. Finally, time distribution characteristics same analyzed identify variation rules. The system extract noise points sudden deterioration quality. We tested model datasets three indices, pH, ammonia nitrogen (NH4N), turbidity, Yantai Menlou Reservoir from May August 2019. demonstrate that efficiently perform data. By analyzing results, found daily pollution events consistent with actual situation.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Quality-Centric Data Model for Distributed Stream Management Systems

It is challenging for large-scale stream management systems to return always perfect results when processing data streams originating from distributed sources. Data sources and intermediate processing nodes may fail during the lifetime of a stream query. In addition, individual nodes may become overloaded due to processing demands. In practice, users have to accept incomplete or inaccurate quer...

متن کامل

Holistic distributed stream clustering for smart grids

Smart grids consist of millions of automated electronic meters that will be installed in electricity distribution networks and connected to servers that will manage grid supervision, billing and customer services. World sustainability regarding energy management will definitely rely on such grids, so smart grids need also to be sustainable themselves. This sustainability depends on several rese...

متن کامل

Entropy-based Consensus for Distributed Data Clustering

The increasingly larger scale of available data and the more restrictive concerns on their privacy are some of the challenging aspects of data mining today. In this paper, Entropy-based Consensus on Cluster Centers (EC3) is introduced for clustering in distributed systems with a consideration for confidentiality of data; i.e. it is the negotiations among local cluster centers that are used in t...

متن کامل

DCF: An Efficient Data Stream Clustering Framework for Streaming Applications

Streaming applications, such as environment monitoring and vehicle location tracking require handling high volumes of continuously arriving data and sudden fluctuations in these volumes while efficiently supporting multidimensional historical queries. The use of the traditional database management systems is inappropriate because they require excessive number of disk I/O in continuously updatin...

متن کامل

Streaming-Data Algorithms for High-Quality Clustering

Streaming data analysis has recently attracted attention in numerous applications including telephone records, web documents and clickstreams. For such analysis, single-pass algorithms that consume a small amount of memory are critical. We describe such a streaming algorithm that e ectively clusters large data streams. We also provide empirical evidence of the algorithm's performance on synthet...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Applied sciences

سال: 2023

ISSN: ['2076-3417']

DOI: https://doi.org/10.3390/app13095408